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Abstract ; , 

An automated text generation system may be used to generate financial reports based on 
financial data. The text generation system uses decisional statements in conjunction with 
a base equation, and the decisional statements are evaluated using the base equation to 
determine a resulting logic path. The logic path may then be used to specify appropriate 
text fragments reflecting analytic conditions. The text fragments may include variables 
to vary the text content of the report for aesthetic and analytic purposes, which are 
evaluated at run-time. A variety of analytic conditions may be represented in a flexible 
manner. 
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SYSTEM AND METHOD FOR GENERATION OF TEXT REPORTS 

Background Information 

In the financial industry, it is desirable to disseminate financial data (earnings, stock 
prices, etc.) in a quick and accurate manner. It is also desirable that the format in which 
this information is disseminated is easy for people working in the financial industry (for 
example, brokers and analysts), as well as the individual investor, to quickly understand. 
As a result, such financial information is typically provided in the form of "investment 
reviews" or similarly titled textual reports. These reports provide textual summaries and 
analysis of the financial data presented, and may also provide graphical or tabular 
representations of that data. For example, a report on company ABC may report the 
current stock price and quarterly earnings, may also report the earnings growth or rate of 
return over a selected time period (e.g., annually), and may further provide qualitative 
commentary as to how the company is performing. The person reading the investment 
review may then draw conclusions from the analysis given by the report, for example, 
whether to buy, sell or hold company ABC securities. 

Because such investment reports are textual in nature, they are typically generated by one 
or more people who gather the necessary information, perform appropriate calculations 
and then compose the textual summaries that provide the analysis of financial 
performance levels. However, using people to generate these reports has many 
drawbacks. It takes time to generate these reports, even with the aid of electronic 
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publishing and spreadsheet tools. Market reaction to financial data is often instantaneous, 
and the sooner "raw" financial information can be digested and made available to 
financial professionals the better. Furthermore, thousands of companies around the world 
are traded publicly, and therefore the amount of manpower needed to generate financial 
5 summaries — which is directly proportionate to the number of companies to be reviewed 
- is substantial. 

Attempts have been made to automate the process of generating such financial reports. 
For example, Research Data Group, Inc. (of San Francisco, California), has provided 

l o investment reports on mutual funds generated via a computer program that calculates 
various financial benchmarks based on supplied financial data, and provides textual 
summaries based on these calculations. This text provides the reader with descriptions of 
the relative performance of the subject fund versus its peer funds and versus a market 
index such as the S&P 500. It also provides text that identifies investment sectors and the 

1 5 percentage each sector represents of the fund's portfolio, as well as giving the reader a 

textual interpretation of statistics that quantify the risk in the portfolio relative to the risk 
being assumed in the portfolios of peer funds. 

This program for mutual funds, however, was found to be inadequate for dealing with 
20 common real-world situations involving the various types of securities and corporate 

statistics. The mutual fund program used traditional computational logic methods, which 
were part of a "hard-coded" decisional structure (implemented using Microsoft Visual 
Basic). Thus, this program could not be adapted, for example, to varying security types, 
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corporate financial information, or analytical computations. This program also could not 
be adapted for varying textual responses across different data ranges or different security 
types. 

Furthermore, this program could not adequately deal with complex situations such as 
transitions from positive to negative quantities (and vice versa), evaluations involving 
zero quantities, and evaluations involving quantities converging to zero. These situations 
are not uncommon when evaluating corporate financial data: earnings may be positive or 
negative (or zero); return on investment may be negative or zero; earnings may transition 
from positive to negative over the course of a fiscal year (and vice versa); price-to- 
earnings ratios may experience large changes as earnings expand or contract close to 
zero; as well as other possible situations. 

Summary of the Invention 

An exemplary embodiment according to the present invention is implemented as a text 
generation system. The exemplary text generation system may include a memory system 
that contains raw data, text data, and a number of decisional statements which each 
include at least one operand and a number of result groupings; as well as a text generator 
operatively coupled to the memory system to produce a text report using the decisional 
statements, the raw data and the text data. 
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The text generation system generates text output by evaluating a base equation using the 
operand(s) of a decisional statement to obtain an equation result. The text generation 
system then selects one result grouping out of the number of result groupings of the 
decisional statement based on the equation result, and determines a logic path based on 
the selected result grouping. The system further selects a text fragment from the text data 
based on the logic path, and then generates an output that includes the text fragment. 

The text generation system implementation according to the present invention may be 
beneficially employed to quickly generate textual reports that are easily readable by those 
interested in interpretations of raw data. For example, financial reports may be created 
from financial data (e.g., stock prices, earnings figures) in a format that is easily readable 
s by interested investors. The ability of the exemplary text generation system to handle 
complex analytical cases such as negative-positive transitions of quantities and values 
approaching or equal to zero allows the text generation system to be employed in a 
variety of real world situations where flexibility and adaptability are important 

1 0 Furthermore, such reports can be generated almost instantaneously — as fast as the text 
generation system can access the raw data, compute the various equations and select the 
proper text elements. Accordingly, large numbers of reports can be generated in a short 
period of time, and reports can be provided on a nearly real-time basis. Further 
efficiencies may be achieved where certain report types are in a standardized format, such 
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that similar decisional statements and text fragments can be reused with different raw 

data files to produce multiple reports. 

Brief Description of the Drawings 

Figure 1 shows a block diagram of an exemplary embodiment of a text generation system 
according to the present invention. 

Figure 2 shows a first flow chart of the operation of the exemplary text generation system 
of Figure 1, according to the present invention. 

Figure 3 shows a portion of an exemplary data file for use by the exemplary text 
generation system of Figure I, according to the present invention. 

Figure 4 shows a portion of an exemplary map file for use by the exemplary text 
generation system of Figure 1, according to the present invention. 

Figure 5 shows a portion of an exemplary text data file for use by the exemplary text 
generation system of Figure I, according to the present invention. 

Figure 6 shows a portion of an exemplary output listing file provided by the exemplary 
text generation system of Figure 1, according to the present invention* 
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Figure 7 shows a second flow chart of the pperatipn of the exemplary text generation 
system of Figure I, according to che present invention. 

Detailed Description 

According to the present invention, a computer application may be implemented that 
generates human-readable summary text reports from raw numerical or other data. For 
example, an embodiment according to the present invention may be used to generate 
financial summary reports based on raw financial data, such that these financial reports 
may be provided to financial industry professionals or other interested persons on a 
current or even real-time basis (such as, for example, by electronic delivery). The present 
invention may also be used to implement text generation systems for other fields where 
such text generation is desirable. 

According to the present invention, the exemplary embodiment may be provided with 
flexible data evaluation and text selection features. Data evaluation may be achieved 
using a common "base equation", and by specifying a series of "decisional statements" 
which utilize the base equation. The base equation may be evaluated using operands 
listed in a decisional statement, and a "logic path" may be determined based on the result. 
The logic path may then be used to select a collection of text fragments corresponding to 
the logic path. Text may be generated by selection of individual text fragments within 
the collection of text fragments, according to a "text template" for the collection. The 
text fragment collections may specify a number of selection options, which may be 
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structured in a manner that allows the composition of natural and grammatically correct 
text. 

Figure 1 illustrates an overall system architecture of an automated financial text 
5 generation system 1 in accordance with an exemplary embodiment of the present 

invention. A text generator 10 is operatively coupled to a memory system 11, which may 
include one or more memory devices (RAM, ROM, disk, or other storage format). The 
text generator 1 0 may include a computer program with one or more processors (e.g., 
microprocessors) or other computational elements. Financial data 12 are provided in the 

10 memory system 1 1 in a particular format (e.g., a common spreadsheet format, comma 
delimited text or other format) such that the text generator 10 may parse the financial 
data for the data that are needed for use in generating a summary financial report 20. 
Financial decisional statements 14 are also provided in the memory system 1 1 in a 
particular format (which will be described further below) such that the text generator 10 

1 5 may determine those calculations that should be performed in generating the report, and 
the conclusions that should be drawn from those calculations. Text data 16 are also 
provided in the memory system 1 1 , to provide a library of text fragments that will used to 
generate the report 20. 

20 The financial summary report 20 generated by text generator 10 may be in a particular 
format (for example, printed text, ASCII file, hypertext file, etc.) according to the 
ultimate destination for the report For example, the exemplary financial text generation 
system 1 may be operated to produce a financial report 20 for dissemination over the 
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Internet, in which case it maybe desirable that the resulting financial report 20 be - 
formatted as a Hypertext Markup Language (HTML) document. 

An overview of an exemplary operation of the exemplary financial text generation system 
1 is illustrated by the flow chart of Figure 2. In step 50, the text generator 10 accesses the 
financial decisional statements 14 in memory system 11. Each financial decisional 
statement 14 may be comprised of, for example, at least one reference to financial data 
and a number of "result groupings" as defined by at least one "result group delimiter" 
listed in the decisional statement 14. The reference(s) to financial data will be used to 
calculate a result for a pre-defined base equation of the text generation system. The result 
groupings represent ranges (specified by the result group delimiters) into which the 
results of the base equation calculation will be segmented, and therefore the result 
groupings serve as distinct quanta to represent ranges of possible results of the base 
equation calculation using the financial data. 

These decisional statements 14 may be tailored to the type of financial data being 
analyzed. For example, a decisional statement 14 may be written to evaluate the return 
over the past year for a particular security. Such a decisional statement 14 might take the 
form: 

curprice,price 1 2mo, 1 020,30 
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Wter? "ci^rice^represente curreni pripe of the security, "price 12mo" represents the 
price of the security 12 months ago, and "10," "20;' and "30" are result group delimiters 
defining a number of result groupings for possible results from the calculation of the base 
equation using the "curprice" and "pricel2mo values". The specific components of an 
exemplary decisional statement for use with the exemplary text generation system 1 will 
be further described below. 

Each of the financial decisional statements 14 are evaluated by: accessing any raw 
financial values from the financial data 12 needed to resolve the references to financial 
data in the decisional statement 14 (step 52), computing a result for the base equation 
using these figures (step 54), and selecting a result grouping based on the equation result 
(step 55). The result grouping generated through the evaluation of the decisional 
statement 14 is used to determine a logic path (step 58) for generating appropriate text. 

Multiple financial decisional statements 14 may be grouped together, such that a logic 
path may be produced based on the evaluation of the multiple decisional statements 14 in 
combination. In such a case, each financial decisional statement 14 in a grouping is 
evaluated until all the decisional statements 14 of the group have been evaluated (step 
56), and then a logic path is specified based, for example, on the result groupings 
generated by each of the decisional statements 14 of the group (step 58). 

Once a logic path has been determined (step 58), the logic path is used to reference a 
collection of the text data 16 tailored to the analytic condition represented by the logic 
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path (step 60). This collection may include a range of possible phrasesappropriate for 
the analytic condition represented by the logic path, and text may then be selected from 
the collection to promote a nature! textual style (e.g., by random selection). Once the 
desired text is selected, it is written to an output (step 62) and the process is performed 
again for the next decisional statement 14 (or group of decisional statements 14), if any 
(step 64). Once all the decisional statements 14 have been evaluated, the report 20 is 
complete, and may then be disseminated in a desired fashion (print, e-mail, world wide 
web site, online service, etc.). 

A more detailed description of the exemplary financial report text generation system I 
according to the present invention will now be explained. Figure 3 depicts an exemplary 
financial data file 300 containing financial data for a fictitious company XYZ Corp. 
Individual elements of information — textual and numerical — are provided in financial 
data file 300 delimited by commas, although other known delineation techniques may be 
used. The financial data file 300 is formatted in a uniform manner to allow retrieval by 
the text generator 10 in the course of evaluating financial decisional statements 14. The 
contents of financial data file 300 may be stored in the memory system 1 1 (e.g., on a 
magnetic disk) to allow access by the text generator 10. 

The actual contents of the data file 300 may vary according to the type of report desired. 
Figure 3 illustrates a portion of the kind of data content that may be found in a financial 
data file 300 for XYZ Corp. Exemplary data file 300 includes the formal name 302 of 
the company ( H XYZ Corporation"), the colloquial name 304 of the company ("XYZ 
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Corp."), and the company's t^ng^teQQGX^YZ"), , Also provided. is the name of 
the industry sector 308 to which this company belongs ("Technology") and the industry 
group 310 within the sector ("Globally Established Software Companies"). In this case, 
the data file 300 is arranged such that the text generator 10 may determine the locations 
of the various data as needed (for example, using a data mapping file, or any other 
organizational technique as may be well known by those in the art). 

The data file 300 further includes typical financial statistics related to XYZ Corp., such as 
the current stock price 312 ("90.1 25"), the stock price from 1 year ago 314 ("60.000"), 
the industry peer group index value 3 1 6 ("60.2072"), the industry peer group index value 
from 1 year ago 3 1 8 ("43.4245"), the current S&P 500 Index Value 320 (" 1 1 1 1 .770") and 
the S&P 500 Index Value from 1 year ago 322 ("756.619"). Other values that may also 
be included are the gross revenue, earnings per share, P/E ratio and Average Qualitative 
Opinion (AQO) figures for XYZ Corp., as well as any other pertinent financial or 
corporate information. The content of the data file 300 is dependent on the type of report 
to be generated and specifically on the data required by the financial decisional 
statements and text data used to compose the desired report. 

Figure 4 shows a portion of the contents of an exemplary financial decisional statement 
file 400 - also referred to colloquially as a "map" file. The statement file 400 is 
sometimes called a "map" file because it provides the "roadmap" of how to generate a 
desired report The contents of exemplary statement file 400 may be stored in the 
memory system 1 1 (e.g., on a magnetic disk) to allow access by the text generator 1 0. 
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En ^.!^ x ^ 400 may be delimited by, forexample,null 

characters, or any other well known format that allows the text generator 10 to retrieve 
individual entries. 



Exemplary decisional statement file 400 is grouped by labels 402 (e.g., "PRICE02", 
"PRICE03", TRICE04") used to identify the different pieces of text that may be needed 
in order to generated the desired report. Beneath each label 402 is a statement count 404 
that indicates the number of decisional statements 406 grouped underneath each label 
402. For example, three decisional statements 406 are grouped under label PRICE03, as 
indicated by the numeral "3" in Figure 4. 



Each exemplary decisional statement 406 includes two operands 408, 410, followed by at 
least one result group delimiter 414 used to indicate the start and end of each result 
grouping. The operands 408, 410 are used during text generation to evaluate a base 
equation used by the text generator 10 to determine logical relationships. Although many 
types of equations may be used as possible base equations, the exemplary embodiment of 
the financial text generation system 1 uses the following equation as the base equation: 



Operand 1 

f=- x 100 

Operandi 



This equation is useful in the present example of a financial report generator because it 
allows for the computation of ratios and percentages that are common in financial 
analysis. 
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*? tthe "umber of operands inclined in each decisional statement 406 is dependent 

on the type of base equation used. For example, if the base equation were specified as 
follows 

Operandi 

which requires only one operand, then the decisional statements used with this base 
equation need only include one operand. In the presently described embodiment, the 
base equation uses two operands, and therefore each decisional statement 406 includes 
two operands 408,410. 

The result of the calculation of the base equation using the operands 408,410 is then 
compared with the result group delimiters 414 to determine into which result grouping 
the result fits. For example, given the simple decisional statement 406: 

1,2,25,50, 75, 100 

the operands 408,410 are U P amd "2", and the result group delimiters 414 are "25 " "50," 
"75," and "100." These result group delimiters 414 define five result groupings, which 
may given letter designations for ease of description: 

result grouping "A" x <= 25 
result grouping "B" -5> 25 < x <= 50 



Express Mail No: EL088530266US 



CA 02294662 2000-01-10 
15 

i^ul^groupmg 75 .... 

result grouping "D" -> 75 < x <= 100 
result grouping "E" -> 100 <x 

where *x" is the result of the base equation calculated using the operands 408,410. 
Thus, the execution of this simple exemplary decisional statement 406 would be as 
follows: 

- x 100 = 50 -* result B ("50") 
2 ' 

Operands 408, 410 may be either numerics or variables (e.g., "cocurprice", 
"copricel2mo'\ etc.), and may include embedded arithmetic operations (e.g., addition, 
division, etc.). Where a decisional statement 406 uses a variable name, the value for the 
variable is retrieved during text generation from the value for that variable in, for 
example, the financial data file 300 (see Figure 3). 

As mentioned previously, each decisional statement 406, when evaluated, results in the 
determination of a specific result grouping (sometimes referred to colloquially as a 
"bucket") based on the results of the base equation calculation. These groupings may be 
conveniently referred to in terms of letter designations: i.e., the first result grouping is 
result H A n , the second result grouping is result "B", and so forth. In the previous simple 
example, the decisional statement evaluated to the second result grouping, result "B." 
The different result groupings are defined by the result group delimiters 414, which 
indicate the upper and/or lower limits of each result grouping. The use of these distinct 
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quanta (classifications) for result values allows Jogic paths to be developed to deal with a 
wide variety of outcomes, as well as for groupings of multiple decisional statements 406, 
as will be further discussed below. 



The use of the decisional statements 406 to generate logic paths may be further described 
through the following example. A decisional statement is specified to determine the 
performance of a corporation (as represented by earnings per share) during the current 
fiscal year as compared to the previous fiscal year: 



EPScurl2mo, EPSprevl2mo, 80, 90, 97,102, 1 10, 120 

The decisional statement above specifies two operands - earnings per share for the 
current twelve months and earnings per share for the previous twelve months - that will 
be used to compute a result for the base equation, and which will then be applied to 
determine the appropriate result grouping. The decisional statement given above also 
includes seven result groupings (A3,C,DJEJ\G) defined by six result group delimiters 
(80, 90, 97, 102, 1 10, 120) specified in the decisional statement. 



Upon evaluation of this decisional statement during text generation, the text generator 
will compute the following equation: 



EPScUr,2m ° x 100 = RESULT 



EPSprevl2mo 
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The text generator^ grouping. using the result group 

delimiters: 

If RESULT <= 80, the decisional statement outcome is "A" 
If 80 < RESULT <= 90, the decisional statement outcome is " B"; 
If 90 < RESULT <= 97 the decisional statement outcome is "C M ; 
If 97 < RESULT <= 102, the decisional statement outcome is " D"; 
If 1 02 < RESULT <= 1 1 0, the decisional statement outcome is " E"; 
If 1 1 0 < RESULT <= 1 20, the decisional statement outcome is " F ; 
If 120 < RESULT, the decisional statement outcome is "G". 

A logic path would then be established based on the result grouping. As will be further 
described below, the text generated for the logic path based on cases A-G may then be 
tailored to the particular conclusion to be reached from each result grouping. For 
example, if the result for the above decisional statement example was M A H , that would 
indicate a very poor current performance relative to the previous fiscal year, and the 
resulting text may reflect such a bad result Likewise if the outcome of the above 
decisional statement was "G w , that would indicate an exceptional performance, and the 
resulting text may be tailored to reflect such a good result 



A beneficial result of the use of distinct result groupings is that the text generator may 
easily process non-linear result groupings, results approaching or equal to zero, or results 
that involve negative values. For example, in the above decisional statement example, 
the result group delimiters of the decisional statement reflect a non-linear distribution of 
result groupings, which allows the text generator to select text that better matches the 
desired interpretation of the data. 
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- . As ; ^pth^,ej^mple ofthe use of decisional statements, the following group of statements ~ 
compares the earnings of a fictitious company DEF Co. for the present quarter (curqEPS) 
to the earnings for the previous quarter (prevqEPS): 

curqEPS, 1,0 
prevqEPS, 1,0 

(curqEPS+0.0001),^ 

(curqEPS+0.0001MprevqEPS+0.0^^ 

The first decisional statement determines whether current quarter EPS for DEF Co. is 
negative, zero or positive by computing the base equation as follows: 

curqEPS 
— ^ xlOO 

If curqEPS is negative or zero, the result grouping will be "A" while if curqEPS is 
positive, the result grouping will be The next decisional statement determines 
whether the previous quarter EPS for DEF Co. (prevqEPS) was negative, zero or positive 
in a similar manner. The logic path generated for this group of decisional statements will 
therefore reflect the sign of the earnings values. 

The third decisional statement computes a comparison between the current quarter EPS 
figure and the previous quarter EPS figure: 

curqEPS + 0.000! %gxex 

- xlOO 

prevqEPS +0.0001 
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In order to account for division-by-zero problems, a miniscule amount (relative to the 
values being measured) is added to each variable for which a zero value is possible. The 
presence of zero earnings in either quarter in this example would be indicated by the base 
equation producing an abnormally large result or an abnormally small result: 



if curqEPS = 0, then base equation < 0.01 
if prevqEPS=0, then base equation >100 



The result groupings of the third decisional statement reflect these possibilities: a result 
"A" will be returned for a base equation <=0.01, and a result T will be returned for a 
base equation >100. Note that if the current and previous quarter earnings are both 
positive non-zero numbers, the third decisional statement will yield a result that indicates 
the percentage earnings increase/decrease from the previous quarter to the current 
quarter. Note also that if one (but not both) of the earnings values is negative, the third 
decisional equation will produce a result "A/* as the computation of the base equation 
will yield a negative number. 

The fourth decisional statement determines a subtraction measurement where a 
percentage measurement is inappropriate (e.g., earnings transition from negative to 
positive): 

(curqEPS + 0.000 1 ) - (prevqEPS + 0.0001 )) _ 
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The result groupings for this fourth decisional statement reflect a transition from positive 
to negative earnings (base equation < 0) and a transition from negative to positive 
earnings (base equation > 0). 

After these four decisional statements have been evaluated, the logic path created from 
the result groupings from each decisional statement will indicate the type of text analysis 
required. For example, if DEF Co. had previous quarter earnings of $-0.20 per share and 
current quarter earnings of $-K).12 per share, the four decisional statements would 
evaluate as follows: 

(1) curqEPS,l,0 -3> result "B" 

(2) prevqEPS,l,0 -5> result "A" 

(3) (curqEPS+0.0001), (prevqEPS^.0001)A01,0.^ -3> 

result "A" 

(4) (cuniEPS^.0001)^revqEPS^^ -5> result "I" 

A logic path of BAAI would be generated, and text would be selected based on this logic 
path to reflect that the earnings of DEF Co. had transitioned from negative to positive, 
and had done so by between 20 and 40 cents per share. 

The flexibility of the decisional statement format allows decisional statements to be 
created to handle a wide range of analytical situations, similar to the previous example. 
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igure.5. illustrates a portion of an exemplary text source file 500. Text source file 500 is 
used to store the possible text words, phrases, sentence fragments and sentence and 
paragraph templates to be used to generate the text for a desired report. Exemplary text 
source file 500 (which may also be referred to colloquially as a "mat" file) as depicted in 
Figure 5 is a text source file for a report on the fictitious XYZ Corporation as described 
earlier (filename "XYZ.mat"). The contents of exemplary text source file 500 may be 
stored in the memory system 1 1 (e.g., on a magnetic disk) to allow access by the text 
generator 10. Entries in the exemplary text source file 500 may be delimited by, for 
example, null characters, or any other well known format that allows the text generator 
10 to retrieve individual entries. 

Exemplary text source file 500 is arranged hierarchically into collections of text 
according to the labels used in the associated map file 400 (see Figure 4) and the logic 
paths generated by the decisional statements grouped under those labels. For example, in 
Figure 5, a text file identifier 502 - TRICE03LED" -- indicates that the file entries 
following this identifier correspond to the label "PRICE03" used in decisional statement 
file 400 and the logic path "LED" (described further below). Other labeling systems are 
possible, although the aforementioned format has the benefit being easily understandable 
for debugging purposes. 

The format of exemplary text source file 500 will now be explained. Following each text 
file identifier 502 is an index value 504 that indicates the number of sub-entries that are 
associated with the particular text file identifier 502. For example, as shown in Figure 5, 
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. four sub-entriesare associated-with the text file identifier 502 n PRICE03LED", as * ^ 

reflected by the numeric n 4" on the line beneath the M PRICE03LED" identifier. 

Associated with each text file identifier 502 is at least one sub-entry. The sub-entry will 
5 be at least one text variable 506, which will further include a number of text fragments 
508 that may be assigned to that text variable 506 during text generation. The text 
fragments 508 associated with a particular text variable 506 may be grouped into multiple 
"text sets", for example, in order to provide different verb tenses or other grammatical 
variations. To accomplish this, the text variable 506 will also use text variable indices 
i 0 5 1 0, 5 1 2, which denote the number of text sets associated with the text variable 506, and 
the number of text fragments 508 associated with each text set, respectively. The use of 
text sets allows for the use of, for example, a combination of grammatically consistent 
terms, and helps prevent the repeated use of the same word in the same block of text 

1 5 Referring now specifically to the example given of producing a report for XYZ Corp., the 
text file identifier PRICE03LED is associated with four text variables 506 - "GIVE", 
"WHAT", "RATE", and TEXT". The text variable GIVE has seven text sets (indicated 
by grouping index 510), each of which includes one text fragment 508 (indicated by text 
fragment index 512). The text variable WHAT has two text sets, with three text 

20 fragments 508 in each set 

The text variable TEXT illustrates a nesting feature of the exemplary text generation 
system according to the present invention. The text variable TEXT is a special case, in 
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,*^text!?P!Ti^ any text.file identifier 502 begins with the evaluation of the 

TEXT variable for that text file identifier. The text fragments listed under the TEXT 
variable thus may serve as a templates for generating text related to the text file identifier 
502, and every text file identifier 502 will include one sub-entry that is a TEXT-type 
variable. 

In the example shown in Figure 5, the text variable TEXT includes three text sets of one 
text fragment 508 per set However, within each text fragment 508 is a text variable 
reference 5 14 to another text variable 506 associated with the text file identifier 502 
"PRICE03LED". For example, the first text set of the TEXT variable of Figure 5 
includes the following text: 

Based on this exceptional return, the stock has \GIVE1 a performance . . . 

The single V operator may be used to indicate that the following word is a text variable 
reference 5 1 4. The presence of the text variable reference 5 1 4 indicates to the text 
generator that the variable referenced should be evaluated to determine its contents, and 
an appropriate text fragment 508 should be inserted into the text fragment 508 at the 
location of the text variable reference 514. The numeric index designation appended to 
the end of the text variable reference 514 indicates the particular text fragment 508 within 
the selected text set that should be used at the location specified by the text variable 
reference 514. 
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When a text variable isreferencedv the text-generator processes the text variable by 
selecting a text set from the group of text sets associated \yith the text variable. For 
example, when the "\GIVE1" reference 514 is encountered, the text generator would 
process the GIVE text variable by selecting one of the seven text sets associated with the 
GIVE variable. The text generator may use any known method for selecting a text set - 
for example, a random selection process using a random number generation technique. 
The number index appended to the end of the text variable reference 514 - in this 
example, the "P in "\GIVEr - instructs the text generator to use the first text fragment 
508 listed in the selected text set. 

Each text fragment 508 may also include a data variable reference 5 1 6. The data variable 
reference 516 is indicated, for example, by double T operators prior to a variable name 
that corresponds, for example, to a field in the financial data file 300. Referring 
specifically to Figure 5, the "WCGname* data variable reference 516 in the text fragments 
508 for the TEXT variable ( . . than of the WCGname peer group.'*) is a reference to the 
company's industry group within the industry sector (which, in the case of the XYZ 
Corporation example, is the "Globally Established Software Companies" - see Figure 3). 
During text generation, the summary generator substitutes the item in data file 300 
corresponding to the CGname variable into the location in text fragment 508 where 
"WCGname" is placed: 

. . . than of the Globally Established Software Companies peer group. 
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J^xt generator lamay be used to synthesize the text of the desired reporf based on Hie 
financial data file 300, the decisional statement file 400 and the text data file 500. Text 
generator 10 may include, for example, one or more C language programs that can access 
the memory system 1 1 that provides the financial data, decisional statements and text 
data, and process the decisional statements and text fragments, according to methods well 
known to those in the art. Such processing may be done via batch processing, real-time 
processing, or another well known scheme. 

The operation of the exemplary embodiment of the present invention will now be further 
explained with reference to a hypothetical report generation session for the XYZ 
Corporation discussed above. As such, the financial data for XYZ Corp. are provided by 
financial data file 300 which includes those data items illustrated in Figure 3, and the 
financial decisional statements for the report to be generated on XYZ Corp. are provided 
by the map file 400, which includes the decisional statements illustrated by Figure 4. The 
text data for the report to be generated on XYZ Corp. are provided by text source file 
500, which includes the entries illustrated by Figure 5. 

Figure 6 depicts a portion of an output listing file 600 that tracks the operation of text 
generator 10 on the financial data file 300, map file 400 and text source file 500 for XYZ 
Corporation. Figure 6 is limited for purposes of clarity to only that portion of output file 
600 that reflects the evaluation of the decisional statements associated with the PRICE03 
label of map file 400. Text generator 10 may produce output listing file 600, for 
example, for debugging purposes. Figure 7 illustrates a detailed flow chart describing the 
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operations of text generator =10 in preparing the desired report: In the course of 
processing map file 400, the text generator 10 initially reads the label PRICE03 from map 
file 400 (step 702), which identifies a group of decisional statements to evaluate. The 
text generator 10 then reads the statement count 404 (step 704), which indicates that three 
decisional statements need to be evaluated in order to resolve the logic path for the 
PRICE03 label. The text generator 10 then reads the first decisional statement 406 
associated with the PRICE03 label from the map file 400 (step 705): 

cocurprice,copricel2mo,50,75,85,95,99,101,105,l 10,1 15,125,135,150 

The text generator locates the operands within the first decisional statement 406 - 
"cocurprice" (the company current stock price) and w copricel2mo M (the company stock 
price 12 months earlier) — and determines values for these operands by retrieving the 
appropriate values from the financial data file 300 (step 706). This step is indicated by 
items 602 and 603 of the output listing file 600, which show that the value for cocurprice 
is determined to be "90. 125000" and the value for copricel2mo is determined to be 
"62.000000." 

The operands of the first decisional statement are then applied to the base equation, and a 
result of this calculation is produced (step 708). The evaluation of the base equation 
using the operands yields "145.362903", as reflected by item 604 of output file 600. Note 
that this computation effectively computes the one year return on the securities of XYZ 
Corp. (ignoring dividends). The computation result is then compared to the result 
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result grouping in which the result belongs (step 710). As shown by item 606 of output 
file 600, the "145 .362903" result is compared to each result grouping, beginning with 
group "A," until the text generator 10 determines that the correct classification is group 
"L." At this point, the result grouping is stored for later use in determining the logic path 
(step 712). 

Since the PRICE03 label is associated with more than one decisional statement (step 
714), the second decisional statement associated with the PRICE03 label is read by the 
text generator 10 (step 705): 

cocurprice/coprice 1 2mo,curgrpprice/grp 1 2moprice,80,90,98, 1 02, 1 1 0, 1 20 

In the case of this second decisional statement, the first operand is actually the result of a 
division of the variables "cocurprice" and "copricenmo." Thus, in step 706, the text 
generator 10 retrieves the values for these variables from the data file 300, and calculates 
the results of the "embedded" division operation. The evaluation of the first operand is 
reflected by item 608 of the output file 600. Similar processing is performed to evaluate 
the second operand of the second decisional statement, as this operand also specifies a 
division of two variables ("curgrpprice" and w grpl2moprice M ). The evaluation of the 
second operand is reflected by item 609 of the output file 600. 
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, The.operands of the second decisional statement are then used to compute the f^stilt of 
the base equation (step 708) to yield "104.843133", as reflected by item 610 of output file 
600. This computation effectively computes the one year return on the securities of X YZ 
Corp. relative to the one year return of the industry group index. The computation result 
is then compared to the result grouping delimiters 414 of the second decisional statement 
to determine the appropriate result grouping (step 710). As shown by item 612 of output 
file 600, the "104.843133" result is compared to each result grouping, beginning with 
group "A," until the text generator 10 determines that the correct classification is group 
"E." At this point, the result grouping is stored for later use in determining the logic path 
(step 712). 

The third decisional statement associated with the PRICE03 label then is read by the text 
generator 10 (step 705): 

cocurprice/copricel 2mo,cursp500price/sp5001 2moprice,80,90,98, 1 02, 1 10, 1 20 

As was the case for the second decisional statement, the operands of the third decisional 
statement also call for the division of variables to resolve the desired operands. Thus, in 
step 706, the text generator 10 retrieves the values for these variables from the data file 
300, and calculates the results of the desired divisions. The evaluation of the first 
operand is reflected by item 614 of the output file 600, and the evaluation of the second 
operand is reflected by item 61 5 of the output file 600. 
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_ The operands of third decisional ' statement are then applied to the base equation, with a 
result of "98.927338" (step 708), as reflected by item 616 of output file 600. This 
computation effectively computes the one year return on the securities of XYZ Corp. 
relative to the one year return of the S&P 500 index. The computation result is then 
compared to the result grouping delimiters 414 of the third decisional statement to 
determine the appropriate result grouping (step 710). As shown by item 618 of output 
file 600, the "98.927238" result is classified as a group "D" result. This result grouping is 
stored for later use in determining the logic path (step 712). 

Once the three decisional statements associated with the PRICE03 label have been 
evaluated, the logic path outcome of the equations is determined to be "L" + "E" + "D", 
based on the combination of the result groupings produced by the three decisional 
statements (step 715). The text generator 10 parses the text source file 500 for the text 
file identifier 502 that corresponds to the PRICE03 label and the "LED" logic path - in 
the present example, the identifier "PRICE03LED " (step 716). Note that there may be 
numerous text file identifiers 502 in text source file 500 related to the PRICE03 label that 
correspond to alternate logic paths (e.g., "PRICE03AAA," "PRICE03BBB," and so 
forth). 

Once found, the text generator 10 reads the index value 504 for the PRICE03LED text 
file identifier 502, and locates the TEXT text variable sub-entry (step 718). The text sets 
associated with the TEXT text variable are identified and selected, for example, by a 
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. random, selection process. Jn the-case of the present example, the second text set is 
selected containing the following text fragment 508: 

With this exceptional return, the stock has \GIVE1 a 
performance \RATE1 achieved by the S&P 500 index and 
\WHAT2 one than of the WCGname peer group. 

In step 720, the text variable references 514 and data variable references 516 are resolved 
for the selected TEXT text fragment 508. The text generator 10 parses the selected text 
fragment 508 to find each variable reference (step 722). If the variable reference is 
determined to be a data variable reference 516 (step 724), the appropriate financial data 
value is read from the financial data file 300 (step 726). For example, once the text 
generator 10 finds the data variable reference "WCGname", the corresponding data file 
value ("Globally Established Software Companies") is read from the data file 300, 
reflected by item 619 in the output file 600. If the variable reference is determined to be 
a text variable reference 514, the text variable is located in the text source file 500 (step 
728) and a text fragment 508 is selected from the text fragments associated with the text 
variable (step 730). For example, when the text generator 10 finds the text variable 
reference "\GIVEP, the "GIVE" text variable is located in the text source file 500, and 
the number of text sets is identified as seven, based on the text set index 510 (step 728). 
A text fragment is then selected by, for example, randomly selecting one of the seven text 
sets, and selecting the first text fragment 508 listed in the selected text set, because the 
"P in ^GIVEP specifies the first text fragment of the set (step 730). 
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Once all of the variable references 514, 516 have been resolved (step 732), the text 
associated with the PRICE03LED logic outcome is composed and stored for use in the 
report (step 734), as reflected by item 620 of the output file 600: 



With this exceptional return, the stock has PROVIDED a 
performance MEETING THE RETURN achieved by the 
SAP 500 index and A STRONGER one than of the 
GLOBALLY ESTABLISHED SOFTWARE 
COMPANIES peer group. 



where the capitalized text is the text determined by processing the variable references at 
runtime. 



As mentioned previously, according to the exemplary systems and methods described 
above, an embodiment of the present invention may be implemented and employed to 
quickly generate textual reports for financial (or other) data in a flexible manner that 
accounts for real-world analytical situations and produces human-quality text Reports 
may be generated in real-time and delivered in real-time, depending upon the speed and 
capacity of the text generation system and the desired delivery method Large numbers of 
reports can be generated in a short period of time and, for reports which use a common 
format, multiple reports may be generated from a single set of decisional statements and 
text data, reducing overhead 
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In the preceding specification, the present invention has been described with reference to 
specific exemplary embodiments thereof. It will, however, be evident that various 
modifications and changes may be made thereunto without departing from the broader 
spirit and scope of the present invention as set forth in the claims that follow. The 
specification and drawings are accordingly to be regarded in an illustrative rather than 
restrictive sense. 
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What is claimed is: ....... > w ^.^ . ... 

1 . A text generation system, comprising: 

a memory system storing raw data, decisional statements and text data, the 
decisional statements including a first operand and result groupings; and 

a text generator operatively coupled to the memory system and producing a report 
using the number of decisional statements, the raw data and the text data. 

2. The system of claim 1 , wherein each of the decisional statements further includes 
a second operand. 

3. The system of claim 2, wherein the text generator includes a base equation, and 
the wherein the text generator calculates a result of the base equation for at least one of 
the decisional statements, using the first operand of the at least one decisional statement 
and the second operand of the at least one decisional statement 

4. The system of claim 3, wherein the base equation includes a division operation on 
the first operand and the second operand. 

5. The system of claim 4, wherein the text generator selects one of the result 
groupings of the at least one decisional statement based on the result of the base equation. 

6. The system of claim 1 , wherein the text data includes text fragments. 
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7. The system of claim 6, wherein the text generator evaluates at least one of the 
decisional statements to determine a logic path. 

8. The system of claim 7„ wherein the text generator selects at least one of the text 
fragments for the report as a function of the logic path. 

9. The system of claim 6, wherein a first one of the text fragments includes a 
variable reference. 

10. The system of claim 9, wherein the variable reference is a text variable reference. 

1 1 . The system of claim 10, wherein the text generator resolves the text variable 
reference by replacing the text variable reference with a second one of the text fragments. 

1 2. The system of claim 1 1 , wherein the text generator further includes a random 
number generator, and the text generator selects the second one of the text fragments as a 
function of a random number generated by the random number generator. 

13. The system of claim 12, wherein the text generator selects the second one of the 
text fragments as a function of an index number. 

14. The system of claim 9, wherein the variable reference is a data variable reference. 
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1 5. The system of claim 14. wherein the text generator resolves the data variable 
reference by replacing the data variable reference with a data file entry corresponding to 
the data variable reference. 

16. The system of claim 6, wherein the text fragments are organized as text sets. 

1 7. The system of claim 16, wherein the text generator evaluates at least one of the 
decisional statements to determine a logic path and the text generator selects at least one 
text set as a function of the logic path. 

1 8. The system of claim 1 7, wherein the text generator further includes a random 
number generator, and the text generator selects the at least one text set as a function of a 
random number generated by the random number generator. 

1 9. The system of claim 1 8„ wherein the text generator selects at least one text 
fragment from the at least one text set 

20. The system of claim 1 , wherein the report is a financial report 

21. A method of text generation, comprising the steps of: 

evaluating a base equation using a first operand and a second operand of a 
decisional statement to obtain an equation result; 
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selecting one result grouping out of a number of result groupings of the decisional 
statement, based on the equation result; 

determining a logic path based on the one result grouping; 

selecting a first text fragment from a text data file located in a memory system 
based on the logic path; and 

generating an output including the first text fragment. 

22. The method of claim 21, further comprising the steps of: 

reading the decisional statement from a decisional statement file stored in the 
memory system; and 

determining a value for at least one of the first operand and the second operand by 
accessing a data file located in the memory system. 

23. The method of claim 21, wherein the selection of the first text fragment includes 
the steps of: 

selecting a text collection from the text file based on the logic path, the text 
collection including the first text fragment; and 

selecting the first text fragment from the text collection. 

24. The method of claim 23, wherein the first text fragment includes at least one text 
variable reference to a text variable, and further comprising the step of resolving the at 
least one text variable reference. 
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25:- Themethod of clainv24, wherein the step of resolving the text viable reference 
includes the steps of: 

finding a number of text sets associated with the text variable in the text file; 

randomly selecting a text set from the number of text sets associated with the text 
variable, the selected text set including a number of text fragments; 

selecting a second text fragment from the number of text fragments; and 

replacing the text variable reference with the second text fragment. 

26. The method of claim 23, wherein the at least one text fragment includes at least 
one data variable reference to a data variable, and further comprising the steps of: 

accessing a data file located in the memory system; 

retrieving a data file entry corresponding to the data variable; and 

replacing the at least one data variable reference with the data file entry. 

27. A storage medium, comprising: 

a set of instructions that, when accessed by a processor, instruct the processor to 
perform the steps of: 

evaluating a base equation using a first operand and a second operand of a 
decisional statement to obtain an equation result; 

selecting one result grouping out of a number of result groupings of the decisional 
statement, based on the equation result; 

determining a logic path based on the one result grouping; 
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selecting a text fragment from* a text data file Ideated in a immbiy system B^d^ 
on the logic path; and 

generating an output comprising the text fragment. 

28. A text generation system, comprising: 

means for evaluating a base equation using a first operand and a second operand 
of a decisional statement to obtain an equation result; 

means for selecting one result grouping out of a number of result groupings of the 
decisional statement, based on the equation result; 

means for determining a logic path based on the one result grouping; 

means for selecting a text fragment from a text data file stored in a memory 
system based on the logic path; and 

means for generating an output including the text fragment 

Smart & Biggar 
Ottawa, Canada 
Patent Agents 
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